Automatic user-adaptive speaking rate selection for information delivery

نویسندگان

  • Nigel Ward
  • Satoshi Nakagawa
چکیده

Today there are many services which provide information over the phone using a prerecorded or synthesized voice. These voices are invariant in speed. Humans giving information over the telephone, however, tend to adapt the speed of their presentation to suit the needs of the listener. This paper presents a preliminary model of this adaptation. In a corpus of simulated directory assistance dialogs the operator’s speed in number-giving correlates with the speed of the user’s initial response and with the user’s speaking rate. Multiple regression gives a formula which predicts appropriate speaking rates, and these predictions correlate (.46) with the speeds observed in good dialogs in the corpus. An experiment with 18 subjects suggests that users prefer a system which adapts its speed to the user in this way. 1. INFORMATION-GIVING BY VOICE Many commercial telephone dialogs include an information delivery phase, in which the system gives the user information such as a time, a price, a password, directions, a transaction or confirmation number, etc. As far as we know, all IVR and spoken dialog systems today provide information either by playing back a fixed, prerecorded voice, or by using a synthesized voice generated with fixed parameters. With information delivered at a single speed, invariant across users, it will be too fast for some users, such as nonnative speakers, children, and people in noisy environments, and too slow for others, such as business people in a hurry. In terms of time cost, if the speed is too slow there is a clear loss in user time, system time, and connection time; if the speed is too fast there is again a time loss, as the user has to wait for a repetition. Whereas the other phases of commercial dialogs (the greeting, call routing, caller identification, content understanding) have been well studied, and are indeed key concerns in the interactive voice response (IVR) business and ∗Ward is currently at the University of Texas at El Paso. Nakagawa is currently at IBM Japan. This work was supported in part by the International Communications Foundation, Tokyo, and by the Japanese Ministry of Education’s Prosody and Speech Processing Project, headed by Keikichi Hirose. in spoken dialog research, the information delivery phase has received less attention.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic User-Adaptive Speaking Rate Selection

Today there are many services which provide information over the phone using a prerecorded or synthesized voice. These voices are invariant in speed. Humans giving information over the telephone, however, tend to adapt the speed of their presentation to suit the needs of the listener. This paper presents a preliminary model of this adaptation. In a corpus of simulated directory assistance dialo...

متن کامل

A New Single-Display Intelligent Adaptive Interface for Controlling a Group of UAVs

The increasing use of unmanned aerial vehicles (UAVs) or drones in different civil and military operations has attracted attention of many researchers and science communities. One of the most notable challenges in this field is supervising and controlling a group or a team of UAVs by a single user. Thereupon, we proposed a new intelligent adaptive interface (IAI) to overcome to this challenge. ...

متن کامل

Cross-layer Packet-dependant OFDM Scheduling Based on Proportional Fairness

This paper assumes each user has more than one queue, derives a new packet-dependant proportional fairness power allocation pattern based on the sum of weight capacity and the packet’s priority in users’ queues, and proposes 4 new cross-layer packet-dependant OFDM scheduling schemes based on proportional fairness for heterogeneous classes of traffic. Scenario 1, scenario 2 and scenario 3 lead r...

متن کامل

Adaptive Minimum BER Reduced-Rank Linear Detection for Massive MIMO Systems

In this paper, we propose a novel adaptive reducedrank strategy for very large multiuser multi-input multi-output (MIMO) systems. The proposed reduced-rank scheme is based on the concept of joint iterative optimization (JIO) of filters according to the minimization of the bit error rate (BER) cost function. The proposed optimization technique adjusts the weights of a projection matrix and a red...

متن کامل

Adaptive Content Delivery: a New Application Area for Media Computing Research

The explosive growth of the Internet has resulted in increasing diversity and heterogeneity in terms of client device capability, network bandwidth, and user preferences. To date, most Web content has been designed with desktop computers in mind, and often contains rich media such as images, audio, and video. In many cases, this content is not suitable for devices like WebTVs, personal digital ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002